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. . Top quarks produced in multi-TeV processes will have large Lorentz boosts, and their decay 

P^' products will be highly coUimated. In semileptonic decay modes, this often leads to the merging 

JlLi' of the 6-jet and the hard lepton according to standard event reconstructions, which can com- 

r-| \ plicate new physics searches. Here we explore ways of efficiently recovering this signal in the 

muon channel at the LHC. We perform a particle-level study of events with muons produced 

inside of boosted tops, as well as in generic QCD jets and from PF-strahlung off of hard quarks. 

We characterize the discriminating power of cuts previously explored in the literature, as well 

^ . two new ones. We find a particularly powerful isolation variable which can potentially reject 
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light QCD jets with hard embedded muons at the 10^ level while retaining 80~90% of the 
Q ■ tops. This can also be fruitfully combined with other cuts for 0(1) greater discrimination. For 



W^-strahlung, a simple p^-scaled maximum AR cut performs comparably to a highly idealized 
top-mass reconstruction, rejecting an 0(1) fraction of the background with percent-scale loss 
'k>( I of signal. Using these results, we suggest a set of well-motivated baseline cuts for any physics 
analysis involving semileptonic top quarks at TeV-scale momenta, using neither 6-tagging nor 
missing energy as discriminators. We demonstrate the utility of our cuts in searching for res- 
onances in the ti invariant mass spectrum. For example, our results suggest that 100 fb~^ of 
data from a 14 TeV LHC could be used to discover a warped KK gluon up to 4.5 TeV or higher. 
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I. INTRODUCTION 

The LHC promises our first glimpse of top quarks produced at energies far above threshold. 
There has been much speculation about top's role at these energies, given its large coupling 
to the sector that breaks electroweak symmetry. For example, models which complete the 
electroweak sector with strong dynamics often contain a rich spectrum of heavy composite 
resonances with large branching fractions to top quarks. Given the constraints on these models 
from flavor and electroweak precision tests (see e.g., |l|, l2|), as well as from direct searches at 
the Tevatron [31,^], it is generally expected that the lowest-lying composite states must live in 
the multi-TeV mass range. Top quarks produced in the decays of such massive particles will be 
so energetic and highly Lorentz-boosted that they effectively inhabit a new kinematic regime, 
where all of an individual top's decay products can become beamed into a localized region of 
the detector — a "top-jet." Our ability to discover hints of compositeness within the electroweak 
sector, or to place significant constraints on it, may therefore depend on how reliably we can 
reconstruct these highly boosted tops. More generally, final states with boosted tops will serve 
as a probe of any new multi-TeV-scale physics that can couple to top quarks. 

Reconstructing tops at high boost comes with special challenges, since the 0(1) angles be- 
tween decay products in the top rest frame to shrink to AR ~ 0.1 in the lab. Conventional 
searches will often group together two or more of these particles into a single jet, losing in- 
formation on internal kinematics. Further complicating matters, fe-jets face degraded tagging 
efficiency due to the high density of nearly-collinear tracking hits in the inner detectors (see, 
e.g., (5|). Missing transverse energy may also be difficult to utilize for detailed top kinematic re- 
construction since it is nearly aligned with the lepton and the fe-jet, and is therefore particularly 
sensitive to fluctuations in visible energy measurements. 

In a previous publication [6|], we and our collaborators demonstrated that these difficulties 



3e overcome for the case of boosted tops that decay hadronically. (For similar 



can plausibly 

ideas, see also |7l-lllj].) The three jets produced in the top's decay merge according to traditional 
jet definitions, but still leave patterns in the calorimeter that can be discriminated from jets 
initiated by light quarks or gluons. This calorimeter-based "top-tag" potentially outperfoms 
6-tagging for px ~ TeV, achieving efficiencies similar to 6-tagging at much lower px- This opens 
up the possibility of performing a search for resonances in the ti mass spectrum utilizing the 
all-hadronic decay channel, independent of whether 6-tagging can be made to work effectively 
at TeV-scale momenta. For recent studies of these techniques in full detector simulation at 



CMS, see [l2Ml4|. 



Boosted tops that decay semileptonically should be easier to identify, as they feature hard 
leptons and missing energy. Indeed, most work to date on searching for multi-TeV ti resonances, 
such as IsHlSl, has focused on the /+jets channel because it is generally considered to be an 
optimal compromise between signal rate and background discrimination. However, at high top 
boost we inevitably face the question of how to handle leptons that technically live inside of 



jets. This is essentially a new problem for multi-TeV machines, since earlier colliders such as 
the Tevatron would produce leptons in jets mainly through heavy flavor decays or by geometric 
accidents (or fakes). Traditionally, non-isolated leptons are only considered for use in ^-tagging, 
and not as independently reconstructable objects. 

This issue has been dealt with in several ways in the theory literature. The investigation of 



top resonances in 18|, ll9| used conservative lepton isolation criteria, but at significant cost of 
signal efficiency at the highest masses. In [16|, a lepton could either be isolated or have a high 
invariant mass when combined with its host jet. In J8|, a suite of simple kinematic cuts were 
developed to help identify leptons "stuck" inside of jets. 

In this paper we investigate the discriminating power of many of the previously suggested 
variables, as well as two new, especially simple and powerful ones. From this investigation, 
we propose a minimal set of cuts for discriminating semileptonic boosted tops from their two 
major physics backgrounds: heavy flavor pair-production within jets and IV-strahlung off of 
hard quarks. We focus on the relatively clean case of muons. We flnd that the heavy flavor 
background can be very efficiently removed with a novel tracking isolation cut, tallying tracker 
energy in a small isolation cone that shrinks with muon px- This variable is sensitive to the 
nearby showering and decay products of high-pr bottom and charm quarks. This becomes 
more powerful in combination with a cut on the muonic mass-drop variable x^, or on Ai?;,/^, 
both of which where investigated in [8|]. In principle, this combination allows a rejection of 
QCD jets with embedded leptons at the level of 10^, and QCD jets in general at the level of 
10^ ~ 10^, with only 0(10%) loss of signal. For VT-strahlung, we flnd that a simple cut on the 
PT-scaled AR between the muon and the closest jet performs essentially as well as an idealized 
top invariant mass cut using a perfectly measured neutrino three-vector. An 0(1) rejection of 
PF-strahlung events can be achieved with a loss of a few percent of the signal. Throughout, 
we avoid utilizing any parametrization of 6-tagging, and conservatively avoid using ^t for the 
construction of discriminator variables. 

As a test case for our methods, we investigate signals and backgrounds for chiral ti reso- 
nances in the /i-|-jets channel. With our cuts, in combination with modest additional cuts on 
the hadronic side, the heavy flavor background is brought down to a level roughly 1 ~ 2 or- 
ders of magnitude below the irreducible ti background. The VT-strahlung background remains 
important, becoming dominant above about 2.5 TeV invariant mass if no additional discrim- 
ination methods are used. Using the full hadronic top-tag of [6|, I^-strahlung can be made 
completely subdominant, but at an additional 0(1 ~ 10) cost in signal efficiency. We also 
demonstrate that top polarization information, encoded in the relative muon momentum, can 
still be utilized with our semileptonic cuts. 

In section mi we discuss the physics motivation and estimate the performance of discriminator 
variables useful for eliminating jets containing heavy flavor. In section llllt we discuss W- 
strahlung and how to efficiently discriminate against it. Section HVl presents the backgrounds 
in the ti invariant mass spectrum using our methods, and estimates discovery reach for some 



simple models using a nominal set of cuts. We present conclusions in section |Vl 

II. HEAVY FLAVOR 

A. Leptons inside of jets 

The lepton and the fe-jet generated in the semileptonic decay of a boosted top will often 
overlap according to standard event reconstructions. More specifically, for left/right chirality 
top quarks produced at px — I TeV, the lepton and the b quark will be within AR = 0.4 
of each other approximately 44/66% of the time.^ For p^ — 2 TeV, this increases to 86/93%. 
Although the application of standard isolation criteria is a simple way to ensure reliability of the 
reconstructed leptons, the resulting low efficiency and polarization bias are major drawbacks. 
Here we will explore what is possible using leptons that are non-isolated according to traditional 
measures. 

Leptons found inside of jets have traditionally been considered unusable as independent 
objects. Non-isolated leptons are produced in the decays of hadrons containing heavy quarks, 
and are in fact used as a standard heavy flavor tag. Heavy flavor may be produced either 
promptly in the hard collision or from gluon splittings in the parton showers, the latter becoming 
progressively more common in light QCD jets at higher energies.^ In addition, there may also 
be contributions from decays of light mesons. Instrumentation and material effects present 
further complications. 

Electrons are particularly difficult to identify because they can look similar to tt^'s after 
accounting for electromagnetic showering in the inner tracking material. It is not clear how 
difficult this discrimination will be in the crowded environment of a TeV-scale jet. Understand- 
ing this issue requires detailed detector simulations and/or actual data, and therefore we defer 
investigation of electrons to the experimentalists (see, e.g., |22|). Hard muons inside of jets, on 
the other hand, are much less susceptible to instrumental fakes. Moreover, we anticipate that 
any fake muons will be largely eliminated by our procedures outlined below for dealing with 
physics backgrounds. Here we only consider backgrounds with real muons. 

Muons coming from bottom and charm decays have a number of characteristics that distin- 
guish them from muons originating from top decays. The obvious difference is that the mass 
scale between the muon and its accompanying jet is controlled by rrit in the case of top decay. 



^ We obtained these numbers from chiral tt resonance samples at parton-level using the TopBSM package in 



MadGraph/MadEvent 4.4.13 [20|, l21 1 



" This suggests that some care must be taken if &-tags are ultimately used to discriminate semileptonic boosted 
top candidates. QCD jets containing leptons are already enriched with heavy flavor. Therefore the main 
utility of a 6-tag on the semileptonic top would be to suppress the Ty-strahlung background. As noted above, 
we will not explore 6-tags here. 



whereas it may be much smaller for jets with heavy mesons. In addition, there are discrimi- 
nators which can be phrased more geometrically. For a top decay at rest, the muon is largely 
uncorrelated in direction with the 6-jet particles. Consequently, while the muon and 6-jet may 
end up close together in AR in the lab frame, generally there is a gap between them. This 
gap is characterized by the inverse of the boost, 0{mt/pTt)- The analogous quantity in heavy 
meson decay is mi^/pTb or rric/pTci which is smaller unless the meson is relatively soft. Another 
important difference can be found in the shower generated by the original hard parton. For top, 
most of this radiation lives outside of the "dead cone," again characterized by Ai? ~ rrtt/pTt- 
The muon still typically remains isolated at this angular scale, even accounting for the addi- 
tional radiation generated before the top's decay. For bottom and charm quarks, the shower 
instead continues to the potentially much smaller angles rrib/pTb and rric/pTc- Therefore, a 
muon produced in heavy meson decay will be sitting within a cloud of its sister decay products 
and particles produced in the preceding parton shower, whereas a muon produced in top decay 
will be approximately isolated out to Ai?~ mt/prt- 

Several other useful observations about QCD background muons were made in 

• The fraction of the total visible jet energy carried by the muon, z^, is typically small. 
Most heavy quarks within TeV-scale jets are produced late in the shower, and then radiate 
further before hadronization. On average, the muon then carries away approximately 1/3 
of the already modest heavy quark energy. 

• The muon is typically very well-aligned with the center of the (6-)jet: ARf,^ <^ 1. This 
is simply due to the collinear enhancements within the parton shower, and the fact that 
the heavy mesons within the jet are highly boosted. Indeed, criteria such as A/??,^ > 0.4 
are common prerequisites for muon reconstruction. However, for very high-p^ top-jets, it 
may be useful to consider the effectiveness of cuts at smaller AR^^. 

• Removing the muon from the jet typically results in a very small mass-drop. 
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where mi, and m.^,^ are the mass of the 6-jet candidate with/without the muon included. 
This is to some extent a combination of the previous two discriminators. However, because 
x^ uses the jet's mass, it is also sensitive to the distribution of the jet's consituents. For 
instance, even a relatively hard, wide-angle muon is usually accompanied by correlated 
hadrons, tending to push x^ toward smaller values. 



The authors of [8j] explored the discriminating power of these variables (z^, ARh^, x^j) for jets 
initiated by prompt heavy flavor production, with visible mass of the muon-jet system, mb^, 
above 100 GeV. (They also studied the effect of varying an upper cut on mf,^, which we do not 
consider here.) 



A priori, the best combination of cuts is not obvious. Here, we study the performance of 
a handful of variables for discriminating generic QCD jets from genuine boosted semileptonic 
tops. Our analysis includes the three variables of |8| described above. We also scan over the 



invariant mass mfe^, for which fixed cuts have been considered originally in 16|], and also in 
[8|. Finally, we consider a novel "mini-isolation" cut at tracker level inspired by the geometric 
observations above, and which we now describe. 

To form an isolation variable, we must define a cone size. In top decay, the separation 
between the muon and fc-jet scales inversely with the top-jet pt, suggesting that we take a 
cone size with this scaling. Alternately, we may attempt to capture the decay products of 
a hypothetical heavy flavor parent, which ideally would use a cone scaling inversely with the 
heavy meson momentum. More realistically, we can use the pt of the muon itself as a rough 
tracer. We found that the latter choice results in 0(1) better discrimination, since it also acts 
in part like a cut on the muon hardness. Softer muons are given larger isolation cones, and are 
consequently more difficult to isolate. Specifically, we find that a cone size 

15 GeV SrriB ,^.. 

Riso = ^ (2) 

works well, and we take this to be our nominal cone definition. From this, we define an isolation 
variable 

mini-iso = —, (3) 

PTcone 

where the denominator scalar-sums the pxs of all charged particles with p^ > 1 GeV in the 
cone, including the muon.^ We emphasize that this specific choice has been only very coarsely 
optimized, and that finding the best cone merits further study under more realistic conditions.^ 



B. Event simulation and reconstruction 

To get an estimate of the production of muons through QCD processes, we study generic 



dijet event samples generated with PYTHIA 6.4.15 |23| and HERWIG 6.510 |2J], with default 
settings in 14 GeV pp collisions. The simulations implicitly include both prompt and radia- 
tive production of heavy fiavor, and the PYTHIA samples also include decays-in-fiight of light 



mesons. We do not assume that these represent fully trustworth representations of the physics 



■^ The 1 GeV tracking cutoff roughly models the critical pT for spiral-out in the ATLAS and CMS magnetic 

fields. 
^ It will also be important to determine at what point precision tracking actually breaks down due to crowding 
of hits. It is possible that some of the muons removed by our isolation cut under idealized conditions can be 
rejected in the real experiments simply due to poor tracking in the inner detectors. For muons from genuine 
boosted top decays, which tend to be better isolated, we expect that this tracking breakdown will be much 

less likely. 
^ We allow particles to decay within a cylindrical volume of half-length 4m and radius 2m. 



in this untested energy region, but we will see shortly that the effectiveness of our cuts allows 
us a large margin of error. 

We compare these to signal samples consisting of decayed color-octet spin-1 bosons with 
pure left- and right-chirality couplings to tops, generated with the TopBSM package of 



MadGraph/MadEvent 4.4.13 (+PYTHIA) [20|,l2]J. For completeness, we also include Wjj ('W 



strahlung") simulations, similarly generated with MadGraph/MadEvent, which will be described 
in more detail in the next section. 

Reconstruction of the simulated events is similar to |6| , with several modifications. We first 
demand the presence of at least one muon with pt > 30 GeV and |?7| < 2.5. We set the leading 
muon aside, and then deposit all other particles into an idealized calorimeter consisting of 
perfect energy-sampling cells of size At] x A0 = 0.1 x 0.1. (We do not apply a magnetic field.) 
We then cluster the cells using the Cambridge/ Aachen algorithm implemented in Fast Jet 
2.3.4 [25|. The clustering radius is picked according to the event's Ht (scalar-summed pt) 
measured in the semicylindrical region opposite the muon in 0, and with |?7| < 3.0: R = 
{0.8, 0.6, 0.4} for Ht > {500, 800, 1300} GeV.^ We only keep jets which are above pr = 50 GeV 
and with |?7| < 2.5. 

Next, we identify the candidate hadronic top-jet and fe-jet. The former is simply taken to be 
the highest-pr jet in the event. We require this jet to be above px = 500 GeV, and that it 



in the centra 
and ATLAS 



mrt of the calorimeter, |?7| < 1.5, where the granularity is finest in both CMS 
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This latter requirement focuses our attention on those top-jet candidates that 
are most suitable for top-tagging using the calorimeter, but in any case these jets should have 
the best mass resolution for simpler mass-based tags. The fc-jet we identify as the remaining 
jet closest to the leading muon, without application of an explicit b tag. The semileptonic 
top candidate is then the sum of this 6-jet candidate and the leading muon, with ^t folded 
in according to some prescription. However, we will attempt to get as far as possible without 
using ^T explicitly, postponing its introduction until section HVl One consequence of this is 
that whenever we refer to the px of the semileptonic top candidate below, we will actually be 
using the pt of the recoiling hadronic top candidate as a proxy. 

At this level of analysis, we reject about 99% of light quark jets and 97~98% of gluon jets, 
simply from the requirement of the hard muon. The exact numbers depend somewhat on px, as 
well as on the simulation. In particular, PYTHIA appears to have higher pass rates, by a factor 
of about 1.5. Prompt heavy-flavor jets of course pass with much higher efficiency, essentially 
determined by their branching fractions to muons. 



^ It is also possible to use a fixed "fat" clustering scale, but somewhat greater care will be required in order 
to remove jet activity uncorrelated with the top decays, particulary FSR off of the top itself. Our variable 
clustering radius exploits the fact that the angular separation between top decay products shrinks as the 



event energy scale increases. (For other methodology and uses of variable jet clustering radius, see |26[.) We 
also note that the 6-jet in semileptonic decays may be reclustered using a somewhat smaller scale in order to 
improve momentum and mass resolution, but we have not explored this. 
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FIG. 1: Distributions of the muon energy fraction z^ for 1 TeV and 2 TeV semileptonic top candidates 
reconstructed from LH chiral ti (purple), RH chiral ti (red), PYTHIA QCD (black), HERWIG QCD 
(grey), and Wjj (blue). 

C. Discriminator analysis and choice of nominal cuts 

To get an idea of how best to discriminate semileptonic boosted tops from light jets, we scan 
over signal and background efficiencies obtained by independent ID, one-sided cuts on the five 
variables discussed in subsection III At z^, Ai?b^, x^, m;,^, and mini-iso. To get a sense for 
how the effects of these cuts scale with top pt-, we look at candidate semileptonic top-jets at 
Pt — 1 TeV and aX px — 2 TeV. The normalized distributions for these variables are shown 
in Figs. [T]to[5l and their helicity-averaged discrimination curves using PYTHIA dijets in Fig. [61 
The discrimination curves for HERWIG are very similar, as can be inferred from the distributions 
of the individual variables. 

In this analysis, for sake of brevity, we have not individually distinguished light quark, 
gluon, and prompt heavy-flavor jets within the continuum dijet simulations described above. 
The results will therefore most directly apply to searches in tt, where dijets are the relevant 
QCD background. We note that the composition of jets analyzed is approximately 14% light 
quarks, 56% gluons, 18% prompt 6, and 12% prompt c for the 1 TeV PYTHIA samples. The 
composition shifts to 29% light quarks, 51% gluons, 11% prompt fe, and 9% prompt c for the 2 
TeV PYTHIA samples. The HERWIG samples have similar composition. 

From Fig. El we infer that x^ and mini-iso are the best individual discriminating variables, 
with the latter displaying the strongest discrimination. In fact, mini-iso appears to be such a 
good discriminator that we can apparently drive the background efficiency below the part-per- 
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FIG. 2: Distributions of Ai?^^ for 1 TeV and 2 TeV semileptonic top candidates reconstructed from 
LH chiral ti (purple), RH chiral ti (red), PYTHIA QCD (black), HERWIG QCD (grey), and Wjj 
(blue). 
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FIG. 3: Distributions of the mass-drop x^ (Eq. [T]) for 1 TeV and 2 TeV semileptonic top candidates 
reconstructed from LH chiral ti (purple), RH chiral ti (red), PYTHIA QCD (black), HERWIG QCD 
(grey), and Wjj (blue). 
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FIG. 4: Distributions of rrih^ for 1 TeV and 2 TeV semileptonic top candidates reconstructed from LH 
chiral ti (purple), RH chiral ti (red), PYTHIA QCD (black), HERWIG QCD (grey), and Wjj (blue). 



mil level while retaining more than 80% of the signal.^ However, it is far from clear that our 
modelling of the high mini-iso tail of the QCD jets is so accurate that we can fully trust this 
prediction. While the remarkably good agreement between PYTHIA and HERWIG in Fig. |5] is at 
least an encouraging indication that the physics might be under control, detector effects could 
also be very important. In particular, some of the tracks in the immediate vicinity of the muon 
could be missed or misreconstructed, making a QCD-induced muon look more isolated than it 
is in reality. We do not have the tools to address this issue here. Nonetheless, as is clear from 
Fig. El if even a fraction of mini-iso^s discriminating power survives in the full detector, it will 
remain the strongest variable. 

In section llVt we will perform a study of the ti resonance search reach of the LHC, and for 
this we need to choose a nominal mini-iso cut. Given our uncertainty above, we do not push 
this cut as aggressively as we could, but instead we use the somewhat conservative choice of 
mini-iso > 0.9. The efficiencies of this cut are given in the first row of Tables HI and HTl We find 
a few part-per-mil acceptance of QCD jets, with only about 10% loss of top-jet signal. This by 
itself is enough to bring the QCD backgrounds to the resonance search under good control. 

Even if the performance of mini-iso proves to be less than ideal in the full experimental 
environment, we still have several other discriminating variables with which it can be combined. 
To get a sense for how such a combination would perform, we explore the discriminating power 
of the remaining variables after application of the mini-iso cut at 0.9. The helicity-averaged 
discrimination curves are displayed in Fig. [71 



^ We note that this conclusion is independent of the hard parton which initiates the QCD jet. 
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FIG. 5: Distributions of the muon mini- isolation (Eqs. [2] and [3|) for 1 TeV and 2 TeV semileptonic 
top candidates reconstructed from LH chiral ti (purple), RH chiral ti (red), PYTHIA QCD (black), 
HERWIG QCD (grey), and Wjj (blue). 
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FIG. 6: Efficiency for PYTHIA QCD-jets (reconstructed as candidate semileptonic top-jets) versus 
efficiency for helicity-averaged top-jets at 1 TeV and 2 TeV, scanning over independent one-sided cuts 
described in the text. The variables used are z^ (short dash), Ai?^^ (dash-dots), x^ (long dash), tti^^ 
(dots), and mini-iso (solid). 
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FIG. 7: Efficiency for PYTHIA QCD-jets (reconstructed as candidate semileptonic top-jets) versus 
efficiency for lielicity-averaged top-jets, after tlie mini-iso cut, at 1 TeV and 2 TeV, scanning over 
independent one-sided cuts described in tlie text. The variables used are z^ (sliort dasli), Ai?^^ (dasli- 
dots), x^ (long dash), mf,^ (dots). The fluctuations in the plot are due to depleted statistics in the 
QCD samples. 



We see that ARbfj, and x^ are the best remaining variables. Though ARbfj, outperforms x^ 
for the highest signal acceptance, the latter ultimately appears to allow greater discrimination. 
For either variable, we find that we can achieve an additional factor of 2 ~ 3 reduction in 
the QCD efficiency with only a few percent loss of signal. To take advantage of this, we will 
use a nominal cut of x^ > 0.5, though we also note that a cut of ARb^ > 0.10 at 1 TeV and 
ARbf_i > 0.05 at 2 TeV would perform comparably. The results are summarized in Tables [land 
Hit first without the mini-iso cut, and then in combination.^ In order to test the sensitivity of 
these numbers to our particular choice of mini-iso cut, we also tried a looser cut of mini-iso 
> 0.75. We find that the signal efficiency is largely unaffected, and the background efficiency 
rises by a factor of about 6. This is still a factor of 4 better than x^ by itself. 

In summary, in this subsection we have performed a very simple discriminator analysis for 
QCD jets with embedded muons versus boosted semi-muonic tops, using the five variables z^, 
ARbfj,, Xfj,, mbfj,, and mini-iso. We find that the mini-iso variable offers the most promise, but 
can also be fruitfully combined with x^ or ARb^ for a factor of 2 ~ 3 improvement. We propose 
a baseline set of cuts of 

x^ > 0.5, (4) 



mini-iso > 0.9, 



We obtain comparable efSciencics for QCD jets originating from different species of hard partons. 
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TABLE I: Nominal efficiencies at 1 TeV. 





tL 


tR 


PYTHIA dijet 


HERWIG dijet 


Wjj 


mini-ISO > 0.9 


0.87 


0.93 


0.0038 


0.0023 


0.95 


X;j > 0.5 


0.87 


0.91 


0.0373 


0.0353 


0.96 


combined x^ and mini-iso 


0.83 


0.89 


0.0014 


0.0009 


0.93 


ARb^ X 1^ < 0.8 


0.97 


0.97 


0.9970 


0.9980 


0.45 


all combined 


0.81 


0.86 


0.0013 


0.0009 


0.39 



TABLE II: Nominal efficiencies at 2 TeV. 





tL 


tR 


PYTHIA dijet 


HERWIG dijet 


Wjj 


mini-iso > 0.9 


0.86 


0.93 


0.0026 


0.0024 


0.95 


Xf, > 0.5 


0.84 


0.88 


0.0405 


0.0445 


0.95 


combined x^ and mini-iso 


0.79 


0.86 


0.0012 


0.0010 


0.92 


ARbf^ X ^ < 0.8 


0.93 


0.96 


0.9936 


0.9945 


0.27 


all combined 


0.73 


0.82 


0.0011 


0.0009 


0.21 



which can potentially achieve part-per-mil background efficiency with less than 20% loss of 
signal, as indicated in Tables |T] and [TTl In section [TV] we will see that these cuts can effectively 
eliminate the QCD backgrounds to the ti invariant mass spectrum. 

III. VF-STRAHLUNG 

High-pT events with VT-bosons constitute the second major source of background. A W 
produced in close proximity to a jet looks practically identical to a boosted top. In fact, such 
a configuration becomes progressively more common as the momentum scale is increased well 
above the W mass, since quarks produced at such high energies can radiate Ws much like 
gluons or photons.^ These IV-strahlung emissions are similarly dominated by soft and coUinear 
regions of phase space, but with mw acting as a physical regulator. 

The extent to which this might pose a problem depends crucially on the probability of W 
emission. At pt ^ 'm^w, the emission probability from an individual quark line (multiplying by 



They may also radiate a Z. This case should be largely dealt with using an explicit dimuon Z veto. 
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the 50% chance that it is left-handed chirahty) can be estimated as 

1 C(2 2 PT 

P(iy-strahlung) ^ - — log . (5) 

4 71 mw 

This should be multiplied by the 11% branching fraction to muons. At pr = 1 TeV, the 
emission probability is about 2%, with 0.2% for emitting in the muon mode. While this is a 
small number, the rate of hard quark production at the LHC exceeds that of top production at 
equivalent energies by several orders of magnitude. Moreover, I^-strahlung becomes increas- 
ingly important at higher pr's, in part because of the log-squared growth, but mostly since 
valence quark scattering turns off more slowly than any other process. 

In this section, we will more carefully categorize this background, and consider how it can be 
ameliorated through simple cuts without significantly affecting the boosted top signal. However, 
as with the heavy flavor background, we postpone discussion of absolute rates until section llVt 
where we investigate the impact of VT-strahlung on the it invariant mass spectrum. 

With emission probabilities at the percent scale, IV-strahlung at TeV momenta is still 
a highly perturbative process. We model it at leading order with {W"^ — )■ ^^i^)jj using 
MadGraph/MadEvent 4.4. 13 (+PYTHIA) [20, Q at 14 TeV collision energy. To force the events 



into the px range of our analysis, and to avoid QCD singularities associated with soft/collinear 
regions for the two partons, we demand that the hardest parton is above 450 GeV, the second 
hardest above 50 GeV, and we place a fcy cut of xqcut > 30 GeV. We place no cuts on the 
lepton nor the neutrino. The events are subsequently showered, hadronized and reconstructed 
as in subsection IIIBI The final cross section passing our reconstruction criteria is fairly insen- 
sitive to the detailed values of our generator- level cuts.^*^ We stress that ly-strahlung cannot 
be accurately modelled with the simple 2 — ;■ 2 production of Wj dressed with QCD radiation, 
as would be obtained with PYTHIA or HERWIG standalone programs. In particular, our analysis 
here is essentially orthogonal to that in 8| . 

The first tactic usually considered for discriminating a Wj system from a semileptonic top 
is to apply an invariant mass cut of some kind. Indeed, this has been done using transverse 



mass m [i8|, ll9| , and full neutrino reconstruction in |l5|, ll6| . However, ^t could realistically 
turn out to be unreliable for precision top reconstruction in this regime, and we are led to 
consider alternative variables which use only visible particles. To this end, we capitalize on 
the angular features of ly-strahlung. Ws can be emitted at arbitrary angles, with a would-be 
coUinear singularity cut off at an angle characterized by niy/ /pt- Boosted tops, on the other 
hand, typically decay inside of a cone characterized by the somewhat larger angle mt/px- At 
a bare minimum, then, one can veto events where the W and the "6-jet" are too far apart 
to look like a boosted top decay. However, at first glance, these rough estimates would still 
suggest that the peak of VT-strahlung angles lies well within the distribution for top decay 



10 



Also, using a fully matched sample incorporating {W — ^ IJ- v)j shows no significant change. 
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angles. Fortunately, this picture is slightly misleading, and the PF-strahlung emission actually 
peaks at an angle approximately five times larger than mw/pr- 

This last observation is straightforward to understand. Soft and collinear singularities orig- 
inate at the diagrammatic level from intermediate propagators going almost on-shell. The 
quite large singularity from the squared propagator denominator is highly (but incompletely) 
suppressed by polarization and phase space terms in the numerator, leaving over the usual 
singularities in emission angle and energy. As soon as we move into a region of phase space 
where the denominator is significantly modified due to mw effects, these numerator terms act 
to shut off the emission rate.^^ In the small- angle limit the denominator takes the form 

z{l-z)9^p^ + 2(1 - z)p Uz^p^ + ml,- zp\ + m^, (6) 

where 9 is the final angle between the W and quark, p is the original quark momentum, and z 
is the fraction of this momentum carried by the W . (This definition of z extends down to zero, 
unlike fractional energy with a massive emission.) Emission rates look similar to those in the 
massless limit only when the first term in this equation dominates. 

For strictly collinear emission, the first term in Eq. (j6]) vanishes, leading to the usual dead- 
cone phenomenon, though now due to massive emissions instead of massive emitters. In fact, 
the first term is strictly smaller than the other two for all z when 9 < 2mw/p- For larger 
emission angles, the range of z for which the first term can dominate starts to gradually open 
up. Integrated over z, the emission rate as a function of 9 ultimately peaks at an angle 0(1) 
beyond 2my/ /p. The predicted offset of the ly-strahlung emission peak from the top peak is 
nicely illustrated in the distributions obtained from MadGraph, displayed in Fig. [HI This shows 
the Ai? between the muon and 6-jet candidate at reconstructed top pt of 1 TeV and 2 TeV, 
with the horizontal scale multiplied by pT/TeV to undo the shrinking of angles with top boost. 
Note that for pt ^ "m^w, the muon becomes an excellent tracer of its parent W^s flight direction, 
so it inherits the features under discussion. 

As a simple, experimentally-robust method of eliminating P^-strahlung events, we therefore 
propose placing an upper cut on ARf,^ scaled with l/pr- We call this "anti-isolation." Since 
genuine top quarks will experience shrinking angular scales as pt increases, the efficiency for 
tops stays roughly constant. The probability for a quark emitting a W near the dead cone 
region is also relatively insensitive to pt- However, the total W emission rate, integrating over 
all angles, increases logarithmically with pt (Eq. [5]). Since we deflne the efficiency for W- 
strahlung as the passing fraction of reconstructed VT-l-jets events (appropriately binned in pt), 
the cut naively becomes more effective at higher p^- 



^^ This story does not describe longitudinal emissions, which instead turn off due to Goldstone boson equivalence 
in exactly those regions where transverse emissions are enhanced. The longitudinal component is subdominant 
except at angles smaller than the angle where transverse emissions peak. It is never a large contribution in 
an absolute sense. 
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FIG. 8: Distributions of AR^^ x (pxt/TeV) for 1 TeV and 2 TeV semileptonic top candidates recon- 
structed from LH chiral ti (purple), RH chiral ti (red), PYTHIA QCD (black), HERWIG QCD (grey), 
and Wjj (blue). 

We list the relative acceptances obtained from our simulations in Tables [T] and [TTl placing 
the anti-isolation cut 

Ai?,^ X (pTt/TeV) < 0.8, (7) 

where for pTt we use the pt of the recoiling hadronic top candidate. ^^ With this cut, we can 
eliminate an 0(1) fraction of the VT-strahlung background while losing only a few percent of 
the signal. We also show the effect of combining this cut with the heavy flavor cuts. The two 
sets of cuts are fairly uncorrelated. 

To get some idea of how anti-isolation fares against more traditional mass-based cuts, we 
construct a highly idealized top invariant mass variable, incorporating the exact neutrino 3- 
vector. The distributions of this variable are displayed in Fig. [HI ^'^ No energy smearing has been 



^^ More general classes of events could have additional sources of missing energy, the simplest example being 
dileptonic it. Obtaining a measurement of the full semileptonic top px may then be difhcult or impossible. 
The obvious alternative is to instead use the total visible pT of the semileptonic top candidate, namely of 
the muon plus the nearby jet. We find that this provides comparable discrimination power. However, it 
is important to realize that in cases where the semileptonic top pt is not fully measured, we should also 
be worried about discriminating against VF-strahlung jets of different total px- We do not perform such 
comparisons here. 

^^ A small spike at m\Y is visible for the Wjj sample in the 1 TeV panel. This is from rare events where our 
reconstruction mistakes a hard photon radiated off of the muon as the candidate 6-jet. (Statistics are not 
high enough at 2 TeV to see this as cleanly.) Of course, such a feature would likely not appear with more 
realistic reconstruction criteria. 
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FIG. 9: Distributions of idealized m^^j^y (with perfectly-measured neutrino) for 1 TeV and 2 TeV 
semileptonic top candidates reconstructed from LH chiral ti (purple), RH chiral ti (red), PYTHIA 
QCD (black), HERWIG QCD (grey), and Wjj (blue). 



applied, so the spread in the reconstructed invariant mass for real tops is entirely due to particle 
sampling within the 6-jet. This is some combination of out-of-range particles, uncorrelated 
radiation from the collision, energy lost to neutrinos in semileptonic 5-hadron decay, and 
leakage of top-FSR into the 6-jet reconstruction. The last is the biggest effect as px increases, 
but note that our jet clustering size is coarsely shrinking with pt scale, so that we automatically 
try to stay in the top's dead cone at some level. In any case, our smearing of the reconstructed 
top mass peak is likely extremely conservative. To compare with anti-isolation, we construct 
discriminator curves for ARhf^xprp and m;,^,^ at 1 TeV and 2 TeV. The cuts start at large values 
and scan down to zero. We show the results of the scans in Fig. (TDl The performance is clearly 
comparable, and even slighly better for ARh^^xpx for most of the range. 

As with heavy flavor, more aggressive cuts are possible, but we have attempted to understand 
how much can be done with minimal sensitivity to detector effects and with minimal loss of 
signal. Specifically, we chose to sit near the bend of the background vs. signal efficiency curve 
in Fig. [TOl Other variables may be worth exploring, as well. In particular, it is still possible to 
fold in ^T to get a more complete picture of the kinematics. This may afford some additional 
discriminating power, though our own further investigations using the complete neutrino 3- 
vector (after application of the Ai?^^ x pj-^ cut) suggest that the potential gain may not be 
large. The relative momentum of the muon in the fx+b system (z^) is another simple variable |8|, 
but we do not find that it gives a distinctive distribution compared to real tops. In fact, the z^ 
distribution from IV-strahlung closely mimicks that of left-chirality tops, as we will see below. 
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FIG. 10: Efficiency for jets with VK-strahlung (reconstructed as candidate semileptonic top-jets) versus 
efficiency for helicity-averaged top-jets, scanning over independent one-sided cuts on ARb^xpxt (solid) 
and idealized m;,^^ (dashed). 



IV. SEARCHING FOR MULTI-TEV tt RESONANCES IN /i-^JETS 

The observations above suggest that it will be possible to identify boosted semileptonic 
tops with high purity while maintaining high efficiency. As an illustrative example of the 
effectiveness of our proposed minimal set of cuts, we present a simple search analysis for spin-1 
ti resonances in the /i+jets channel, with the assumption that the LHC will ultimately reach 
its design energy of 14 TeV. We restrict the discussion to resonances with pure left-handed or 
pure right-handed couplings, in order to highlight possible chirality biases in the reconstruction 
and to address the possibility of measuring these couplings independently. We perform our 
analyses for narrow resonances such as from weakly-coupled models, as well as for 15%-width 
resonances such as would arise in strongly coupled models where top is partially composite. 

Event reconstruction follows the same logic as in subsection IIIBI For events passing the 
reconstruction, we further demand that the semileptonic top candidate satisfies our nominal 
heavy flavor and VT-strahlung cuts, as in Eqs. |l]and[7l respectively. 

Until this point, we have been treating all energy measurements as exact, and have been 
explicitly avoiding the use of missing energy for reconstruction of the full neutrino momentum. 
Here, we incorporate smearing of particle energies and a simplistic neutrino reconstruction, in 
order to roughly model their effects on the reconstructed resonance peaks. 

We smear jet energies according to the CMS physics TDR 27l |: 

a{E) 5.6 GeV 1.25 GeV^/^ 



E 



E 



E 



0.033. 
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Technically, this formula only applies to iterative cone jets with R = 0.5, but we do not 
anticipate any major change when going over to C/A jets of comparable size. Using the CMS 
parametrization is the conservative choice, since the ATLAS hadronic calorimeter is expected to 



have better resolution 



28 



The first term also incorporates fluctuations from particle sampling 



within the jet area, which are already implicit in our jet construction. 



To smear the muon energy, we use the parametrization from the CMS muon TDR 29 1 



increasing the coefficient by 25% to better match the resolution curves presented in the more 



recent physics TDR 271 ]: 



a(E) E 

This assumes that global muon reconstruction is possible within TeV-scale top-jets. In principle, 
stiffer muon tracks should be easier to trace from the outer detectors into the inner detectors, 
and, as we have emphasized, muons from top decay will be mini-isolated at tracker level. 

We define ^t to balance the leading reconstructed objects in the event after energy smearing. 
We include the hadronic top-jet, the 6-jet, the muon, and the leading remaining jet, if there are 
any additional jets found. ^^ We reconstruct the pz of the neutrino by merely assigning it the 
same rj as the muon, rather than attempting to impose a W mass constraint. The ti system is 
then simply the sum of the reconstructed hadronic and semileptonic tops.^^ In Fig. [11] we show 
the effects of our reconstruction and smearings on narrow and 15%- width spin-1 resonances with 
M = 3 TeV. The plot includes our nominal procedure, as well as one incorporating an exactly 
measured neutrino 3-vector, for comparison. The instrumental width is clearly dominated by 
our jet and muon energy smearing, and not the missing energy reconstruction. 

Further purification of the signal can be achieved by applying cuts on the hadronic top-jet. 
The hadronic calorimeter top-tag of |6| is an obvious option, provided we are willing to tolerate 
a factor of > 2 loss of signal. As an alternative, we also consider a looser "top-tag" in the form of 
a simple top-mass cut. A hadronic top-jet candidate passes the cut if its (unsmeared) invariant 
mass is between 120 GeV and 300 GeV. This cut passes (80 ~ 90)% of top-jets, (20 ~ 30)% of 
light quark jets, and (30 ~ 50)% of gluon jets, depending on pt- Figure \T2\ shows the leading- 
order backgrounds in the ti invariant mass spectrum using our full reconstruction and cuts, 
including both methods of hadronic top-tagging.^^ In order to maintain good statistics on the 



^^ This jet would usually come from hard ISR or FSR, and incorporating it reduces the occurence of outliers in 

the mass spectrum due to badly mismodelled fix- 
^^ A more sophisticated analysis would also identify possible FSR from the tops before they decay. For example, 

one could simply incorporate the leading jet within some reasonable AR from either of the reconstructed tops. 

Given the level of our estimated smearing, we do not find that this procedure offers significant improvements 

in our final resonance lineshapes. However, incorporation of top-FSR would be very useful to study in a more 

realistic analysis. 
^^ In [6[ , we and our collaborators presented top-tagging efficiencies up to pr = 2 TeV. For the current study, 

we also consider top-jets at even higher pr- For example, at pT = 3 TeV, the efficiencies for tops, light 

quarks, and gluons are about 2%, 0.2%, and 0.5%, respectively. The efficiency for finding tops decreases 
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FIG. 11: Reconstructed ti invariant mass spectra of narrow (left) and 15%-widtli (right) spin-1 reso- 
nances with M = 3 TeV, coupled to t/j. We ignore any interference effects with continuum production. 
The black histogram represents our nominal treatment of missing energy, with error bars from monte 
carlo statistics. The continuous grey histogram (with error bars suppressed) represents a more ideal- 
ized reconstruction using the exact neutrino 3-vector. 



dijet and VT-strahlung backgrounds, we utilize weighted tags on quark and gluon jets (binned 
in pt), based on studies of independent PYTHIA dijet samples. 

For our nominal analysis, we use the hadronic top-mass cut instead of the full top-tag, in 
order to keep good signal efficiency across the invariant mass spectrum. The price we pay for 
this high efficiency is that the reducible VT-strahlung background begins to dominate above 
about 2.5 TeV invariant mass. However, signal reach across the entire invariant mass spectrum 
is still better than the full top-tag analysis. Obviously, a more efficient top-tag or high-pr 6-tag 
could offer further improvements. 

We can already get some sense for the efficiency of our cuts to pass genuine top pairs by 
looking at the irreducible ti background in the left panel of Fig. [121 Over half of the recon- 
structed yU.+jets events survive the cuts. We also plot, in Fig. [131 the final efficiencies of narrow 
chiral spin-1 resonances decaying to ti, normalized with respect to the total crxBR(tt). Perfect 
efficiency for our analysis would correspond to the /i+jets branching ratio, or approximately 
15%. Our procedures achieve efficiencies of close to 10%.^^ 



dramatically due to its decay products falling into adjacent or identical calorimeter cells. We expect that 
more sophisticated techniques, incorporating information from the electromagnetic calorimeter and tracker, 
could likely improve the situation for tops with multi-TeV transverse momenta. 
The slight difference between the efficiencies for reconstructing left-chirality and right-chirality resonances 
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FIG. 12: Backgrounds in the ti invariant mass spectrum at leading order: PYTHIA dijet (black), Wjj 
(blue), and continuum ti in the /z+jets channel (red). Reconstruction of the event invariant mass is 
as described in the text. The left panel displays events in which the hadronic top candidate passes a 
simple top mass cut (also described in the text). The right panel displays events in which the hadronic 
top candidate passes the top-tag of g]. In each panel, the dashed histograms display the background 
spectra before application of any cuts on the hadronic or leptonic top candidates beyond the basic 
reconstruction criteria of subsection III B[ Error bars reflect monte carlo statistics. 



Given the high reconstruction efficiency and manageable level of reducible background, we 
can achieve promising sensitivity to new resonances. We display the estimated reach for 
(TxBR(tt) in Fig. [TH To perform the estimate, we simply count events within coarsely op- 
timized mass windows constructed about each resonance. For the narrow resonances, we take 
rritt within ±10% of the physical resonance mass, and for the 15%-width resonances we take 
±15%.^® A given production cross section is considered discoverable if the expected number of 
signal plus background events is at least 5a above the background-only prediction according 
to Poisson statistics (asymptotically, Ns/^/Nb > 5) and Ns > 10. This estimate essentially 
represents the most optimistic signal reach, assuming well-controlled systematic errors. To in- 
dicate the possible relevance of background uncertainties, we also plot the o"xBR(tt) at which 



mostly owes to the semilcptonic top reconstruction efficiency, as in Tables U and |TI1 The hadronic top-mass 
cut (as well as the full hadronic top-tag) is relatively insensitive to top polarization. 
^^ These windows are actually somewhat asymmetric about the reconstructed resonance peaks, which are a 
few percent below the original mass. Since the background spectrum is a falling function, the windows are 
therefore slightly biased towards the regions with higher signal-to-background ratio. 
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FIG. 13: Nominal final efficiencies for narrow spin-1 resonances decaying into ti. (The /i+jets branch- 
ing fraction sets an upper bound of about 15%.) Open circles represent resonances with purely 
left-handed couplings, and solid circles represent resonances with purely right-handed couplings. 



Ns/Nb = 1 within the signal windows. Superimposed on the plots are several model predic- 
tions. 

We highlight our conclusions for the lightest KK gluon of |2|, a particle which can also be 
viewed as an excitation of a str ong ly-cou p led sector which generates a composite Higgs boson. 



Assuming the model curves of 16| or 17|], respectively, we find that a 100 fb ^ run of 14 TeV 



LHC can potentially discover a KK gluon up to about 4.5 TeV or 5.0 TeV. In previous studies, 
the discovery reach for this particle had been estimated to be less than 4 TeV [l6|, [l9| at 100 
fb-i. 



As pointed out in 16|, ll7| , heavy composite resonances may have enhanced couplings to tn 
versus to t/,, and this bias may be experimentally observable in semileptonic decays. A general 



and polarization 
34| . Here, we do not 



analysis of simple boosted top "polarimeter" variables was performed in 
effects were further explored in the context of boosted hadronic tops in 
attempt to quantify the quality of polarization measurements for different models. However, 
to illustrate the extent to which polarization information is preserved by our procedures, we 
display in Fig. [15] the distribution of the polarization-sensitive variable z^ in the mass window 
rritt = [2700, 3300] GeV. The samples compared include the narrow 3 TeV left- and right- 
chirality resonances, as well as the ly-strahlung background. The z^ distribution from the 
continuum ti background is close to the average between the two chiral resonances, and we 
have omitted it for clarity. It is worth noting that if we had chosen to use z^ (or an analogous 
quantity) as a discriminator variable for eliminating backgrounds, this polarization-sensitive 
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FIG. 14: The minimum discoverable (TxBR(tt) for narrow (left) and 15%-width (right) multi-TeV 
spin-1 resonances. Open circles are pure left-handed chiral resonances, and solid circles are pure right- 
handed. Luminosities displayed are 1 fb~^ (red), 10 fb^^ (black), and 100 fb^^ (blue). The grey 
line represents the o"xBR(tt) at which Ns/Nb = 1 within the signal windows (for the right-chirality 
resonances only). Also displayed are several models. In the narrow resonance plot (left): a pure B — L 



gauge boson with Qr-l = 0.2 



model, with y = 5 



the KK gluon of 



a 



31 



301 1 (piiik dot-dashed), and a KK gluon of the Little Randall Sundrum 



32( 1 (purple dashed). In the 15%-width resonance plot (right): estimates for 



taken from 



16( 1 (pink dot-dashed) and [17| (purple dashed). 



Q 



structure would have been degraded. 

We can make two immediate observations regarding polarimetry measurements. First, left- 
and right-chirality resonances are clearly distinguishable from each other, and nearly reproduce 
the distributions predicted in [33|.^^ If the signal-to-background ratio is favorably large, and 
statistics are high, discriminating between the two cases should be straightforward. However, 
we can also observe that the final z^ distribution of H^-strahlung events closely mimicks that 
of ti- Since W^-strahlung is the main background for resonances above about 2.5 TeV in our 
nominal analysis, care would need to be taken in making reliable polarization measurements. 
Of course, the VT-strahlung contamination could be significantly reduced by applying stricter 



^^ The predicted t^ shape in |33j is actually slowly monotonically rising as z^ approaches one. However, our 
reconstruction methods in subsection III Bl demand the presence of a hard 6-jet candidate to pair with muon. 
This shuts off the rate near z^ ~ \. A more permissive reconstruction would likely yield a more distinctive 
z^ distribution for tn- 
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FIG. 15: Normalized distributions of the polarization-sensitive variable z^ for the narrow 3 TeV 
left-chirality resonance (purple), narrow 3 TeV right-chirality resonance (red), and Wjj background 
(blue), all reconstructed within the ti invariant mass window m^f = [2700,3300] GeV. 

cuts, such as a full hadronic top-tag on the recoiling top candidate (Fig. [12]), though at an 
additional cost of signal efficiency. 

V. CONCLUSIONS AND OUTLOOK 

Efficient identification of boosted top quarks will be important for various well-motivated 
new physics searches at the LHC In this paper, we have explored techniques to optimize 
boosted top identification for the cleanest case of semileptonic decays in the muon mode, 
focusing on observables that should be relatively robust against detector effects. In particular, 
we have avoided the use of missing energy and 6-tags, in contrast to more standard search 
strategies. Nonetheless, we have found excellent background rejection while keeping very high 
signal efficiency, as evidenced by the rates in Tables [I] and [ITl 

The most subtle and ubiquitous background to boosted semileptonic tops is ordinary QCD 
jets with hard embedded muons, dominantly from heavy flavor decays. For TeV-scale jets, 
the heavy flavor usually originates from gluon splittings in the parton shower, as opposed to 
prompt production in the hard interaction. We have compared several candidate discriminator 
variables, and found that the most powerful is a tracker-level "mini-isolation" using a small cone 
which shrinks with increasing muon px- This is in contrast to more traditional lepton isolation, 
which tallies tracker and calorimeter energy within a cone comparable to a flxed jet clustering 
radius, e.g., R = 0.4. Combining with an additional cut on the muonic mass-drop variable x^ 
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of (8| (or the AR between the muon and the center of its associated jet), signal-to-background 
can potentially be purified by almost 1000:1, with only 0(10%) loss of signal. This attenuation 
of background is in addition to the initial requirement that the QCD jet contains a hard muon, 
which occurs with percent-scale probability. For resonance searches in the ti invariant mass 
spectrum, this level of QCD jet rejection reduces the dijet contribution to essentially negligible 
level. The dijet background will remain negligible or modest if even a small fraction of our 
claimed discrimination power can be achieved at the LHC 

Jets containing heavy flavor produce muons through off-shell H^-boson emission. However, 
it is also possible for a light quark to directly emit an on-shell H^-boson, i.e. to undergo W- 
strahlung. These emissions are coUinear-enhanced for TeV-scale jets, occurring with percent- 
scale probability. In the absence of fe-tagging, a light quark jet with IV-strahlung can look 
very similar to a top-jet. Nonetheless, top-jets are more kinematically constrained than W- 
strahlung, and we have investigated a very simple cut that appears to capture most of the 
available kinematic discriminating power: an "anti-isolation" cut on the maximum AR between 
the muon and its accompanying jet. This can achieve 0(1) purification of signal-to-background 
with a few percent loss of signal. 

As a case study, we have investigated the performance of a minimal set of cuts in the con- 
text of a multi-TeV ti resonance search at a 14 TeV LHC. The major remaining backgrounds 
are irreducible ti continuum and reducible PF-strahlung, with the latter becoming dominant 
above about 2.5 TeV. The final resonance reconstruction efficiencies, including the branching 
fraction to yU-|-jets, are close to 10%, suggesting much better sensitivity than has been previ- 
ously estimated 16|, [l9(]. In particular, we find that a warped KK gluon could be discovered at 
masses above 4 TeV with 100 fb~^ of data. Left-chirality and right- chirality resonances can be 
reconstructed with similar efficiency, and our cuts preserve much of the polarization-sensitive 
kinematics. However, the remaining VT-strahlung background mimicks left-chirality tops, pos- 
sibly complicating polarization measurement for high mass resonances unless further cuts are 
applied. 

Our study leaves open several important issues. The most pressing is the possible role of 
detector effects. The simple anti-isolation cut for ly-strahlung is likely the easiest to implement 
without a detailed understanding of the detector. However, construction of the mini-isolation 
variable will require tracking to work quite reliably within the core of a jet, which could be 
a rather crowded environment. We suspect that this possible tracking breakdown would not 
prove to be an insurmountable problem. Genuine top-jets are characterized by a muon slightly 
offset from the bulk of the jet activity, and should be much easier to track and mini-isolate. 
Our main worry then becomes whether we can obtain a reliable measure of the activity around 
the muons within QCD jets. But even if detailed momentum measurements become difficult 
in this case, the nearby density of tracking hits can likely serve as an effective supplementary 
discriminator. Also, we have not modelled possible tracker signals originating from neutral 
pions, due to photons showering in the tracker material. This additional activity will likely 
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make muons from QCD even easier to discriminate. Ultimately, determination of the true 
effectiveness of mini-isolation or analogous discriminators will require more detailed detector 
simulations and analysis of high-energy LHC data. Our particle-level results suggest that such 
pursuits are highly motivated. 

We can also consider various avenues for improvement of boosted semileptonic top identi- 
fication, in particular the incorporation of 6-tags, the use of ^t for detailed kinematic recon- 
struction, and the possibility of using the electron decay mode. 

We have neglected 6-tagging since it may be difficult to implement with good efficiency at 
high, pt, and in any case it cannot be reliably modelled at the level of our analysis. However, even 
a loose displaced vertex tag could be useful for improving discrimination against H^-strahlung 
jets, which are mainly associated with light quarks. For example, an additional 0(1) reduction 
of the py-strahlung background would be quite beneficial for searches in the ti invariant mass 
spectrum, assuming that the signal efficiency can be kept high. 

Missing energy, on the other hand, may be of limited utility. We have managed to essentially 
eliminate heavy flavor backgrounds without ever referencing ^t- For PF-strahlung events, 
after passing our anti-isolation cut the kinematics are already extremely similar to boosted 
tops. Given the susceptibility of ^t to fluctuations in visible energy measurements, it will be 
surprising if signiflcantly better kinematic discrimination can be achieved. 

We have focused entirely on the case of semileptonic top decays into muons, but we might 
also hope to recover a signal in decays into electrons. Work at ATLAS suggests that electron 



reconstruction within top-jets should be possible with 0(1) efficiency |22|, with well-controlled 
QCD backgrounds. We may therefore hope to ultimately achieve somewhat greater signal reach 
beyond a muon-only analysis by incorporating these decays. They could be especially relevant 
in classes of events where we search for more than one boosted top decaying semileptonically. 
Finally, for searches involving both boosted semileptonic and hadronic tops, we note that 
hadronic top-tagging can also have a significant impact. In particular, it serves as an addi- 
tional way to eliminate ly-strahlung background from the ti invariant mass spectrum in the 
/i-|-jets channel, by top-tagging the recoiling jet. We already saw the potential for the hadronic 
calorimeter top-tag of |6| in the right-hand panel of Fig. [121 albeit at high cost to the signal for 
large invariant masses. More sophisticated top-tagging techniques are clearly worth investigat- 
ing. Also, beyond the /x-|-jets search, hadronic top-tagging opens the possibility of performing 
an all-hadronic search for tt resonances p, [ij]. While /x-|-jets is still the optimal resonance dis- 
covery mode, owing to its modest contamination from QCD background, reproducing a claimed 
discovery in the all-hadronic channel would serve as a powerful cross-check. 
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